SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Wahl , Geoffrey M 

0' Gorman, Stephen V 

(ii) TITLE OF INVENTION: FLP- MEDIATED GENE MODIFICATION IN 
MAMMALIAN CELLS, AND COMPOSITIONS AND CELLS USEFUL 
THEREFOR 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pretty, Schroeder, Brueggemann & Clark 

(B) STREET: 444 South Flower Street, Suite 2000 

(C) CITY: Los Angeles 

(D) STATE: California 

( E ) COUNTRY : USA 

(F) ZIP : 90071 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/486,409 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Reiter, Stephen E 

(B) REGISTRATION NUMBER: 31,192 

(C) REFERENCE/DOCKET NUMBER: P41 90004 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 546-1995 

(B) TELEFAX: (619) 546-9392 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: NATIVE FLP 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1269 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG CCA CAA TTT GAT ATA TTA TGT AAA ACA CCA CCT AAG GTG CTT GTT 4 8 

Met Pro Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val 

15 10 15 

CGT CAG TTT GTG GAA AGG TTT GAA AGA CCT TCA GGT GAG AAA ATA GCA 96 

Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys lie Ala 

20 25 30 

TTA TGT GCT GCT GAA CTA ACC TAT TTA TGT TGG ATG ATT ACA CAT AAC 144 

Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn 

35 40 45 

GGA ACA GCA ATC AAG AGA GCC ACA TTC ATG AGC TAT AAT ACT ATC ATA 192 

Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr lie lie 

50 55 60 

AGC AAT TCG CTG AGT TTC GAT ATT GTC AAT AAA TCA CTC CAG TTT AAA 24 0 

Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys 

65 70 75 80 

TAC AAG ACG CAA AAA GCA ACA ATT CTG GAA GCC TCA TTA AAG AAA TTG 288 

Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu 

85 90 95 

ATT CCT GCT TGG GAA TTT ACA ATT ATT CCT TAC TAT GGA CAA AAA CAT 336 

lie Pro Ala Trp Glu Phe Thr lie lie Pro Tyr Tyr Gly Gin Lys His 

100 105 110 

CAA TCT GAT ATC ACT GAT ATT GTA AGT AGT TTG CAA TTA CAG TTC GAA ' 3 84 

Gin Ser Asp lie Thr Asp lie Val Ser Ser Leu Gin Leu Gin Phe Glu 

115 120 125 

TCA TCG GAA GAA GCA GAT AAG GGA AAT AGC CAC AGT AAA AAA ATG CTT 432 

Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 

130 135 140 

AAA GCA CTT CTA AGT GAG GGT GAA AGC ATC TGG GAG ATC ACT GAG AAA 48 0 

Lys Ala Leu Leu Ser Glu Gly Glu Ser lie Trp Glu lie Thr Glu Lys 

145 150 155 160 

ATA CTA AAT TCG TTT GAG TAT ACT TCG AGA TTT ACA AAA ACA AAA ACT 52 8 

lie Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 

165 170 175 

TTA TAC CAA TTC CTC TTC CTA GCT ACT TTC ATC AAT TGT GGA AGA TTC 5 76 

Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe lie Asn Cys Gly Arg Phe 

180 185 190 

AGC GAT ATT AAG AAC GTT GAT CCG AAA TCA TTT AAA TTA GTC CAA AAT 6 24 

Ser Asp lie Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 

195 200 205 

AAG TAT CTG GGA GTA ATA ATC CAG TGT TTA GTG ACA GAG ACA AAG ACA 6 72 
Lys Tyr Leu Gly Val lie lie Gin Cys Leu Val Thr Glu Thr Lys Thr 

210 215 220 

AGC GTT AGT AGG CAC ATA TAC TTC TTT AGC GCA AGG GGT AGG ATC GAT 72 0 

Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg lie Asp 

225 230 235 240 
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CCA CTT GTA TAT TTG GAT GAA TTT TTG AGG AAT TCT GAA CCA GTC CTA 76 8 

Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

AAA CGA GTA AAT AGG ACC GGC AAT TCT TCA AGC AAT AAA CAG GAA TAC 816 
Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr 
260 265 270 

CAA TTA TTA AAA GAT AAC TTA GTC AGA TCG TAC AAT AAA GCT TTG AAG 864 
Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

AAA AAT GCG CCT TAT TCA ATC TTT GCT ATA AAA AAT GGC CCA AAA TCT 912 
Lys Asn Ala Pro Tyr Ser lie Phe Ala lie Lys Asn Gly Pro Lys Ser 
290 295 300 

CAC ATT GGA AGA CAT TTG ATG ACC TCA TTT CTT TCA ATG AAG GGC CTA 960 
His lie Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305 310 315 320 

ACG GAG TTG ACT AAT GTT GTG GGA AAT TGG AGC GAT AAG CGT GCT TCT 1008 
Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
325 330 335 

GCC GTG GCC AGG ACA ACG TAT ACT CAT CAG ATA ACA GCA ATA CCT GAT 1056 
Ala Val Ala Arg Thr Thr Tyr Thr His Gin lie Thr Ala lie Pro Asp 
340 345 350 

CAC TAC TTC GCA CTA GTT TCT CGG TAC TAT GCA TAT GAT CCA ATA TCA 1104 
His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser 
355 360 365 

AAG GAA ATG ATA GCA TTG AAG GAT GAG ACT AAT CCA ATT GAG GAG TGG 1152 
Lys Glu Met lie Ala Leu Lys Asp Glu Thr Asn Pro lie Glu Glu Trp 
370 375 380 

CAG CAT ATA GAA CAG CTA AAG GGT AGT GCT GAA GGA AGC ATA CGA TAC 12 00 

Gin His lie Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser lie Arg Tyr 
385 390 395 400 

CCC GCA TGG ATT GGG ATA ATA TCA CAG GAG GTA CTA GAC TAC CTT TCA 124 8 

Pro Ala Trp lie Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser 
405 410 415 

TCC TAC ATA AAT AGA CGC ATA TAAGTACGCA TTTAAGCATA AACACGCACT 12 99 

Ser Tyr lie Asn Arg Arg lie 
420 

ATCCCGTTCT TCTCATGTAT ATATATATAC AGGCAACACG CAGATATAGG TGCGACGTGA 13 59 

ACAGTGAGCT GTATGTGCGC A 13 80 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



4 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
15 10 15 

Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys lie Ala 
20 25 30 

Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn 
35 40 45 

Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr lie lie 
50 55 60 

Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys 
65 70 75 80 

Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu 
85 90 95 

lie Pro Ala Trp Glu Phe Thr lie lie Pro Tyr Tyr Gly Gin Lys His 
100 105 110 

Gin Ser Asp lie Thr Asp lie Val Ser Ser Leu Gin Leu Gin Phe Glu 
115 120 125 

Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
130 135 140 

Lys Ala Leu Leu Ser Glu Gly Glu Ser lie Trp Glu lie Thr Glu Lys 
145 150 155 160 

lie Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 
165 170 175 

Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe lie Asn Cys Gly Arg Phe 
180 185 190 

Ser Asp lie Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 
195 200 205 

Lys Tyr Leu Gly Val lie lie Gin Cys Leu Val Thr Glu Thr Lys Thr 
210 215 220 

Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg lie Asp 
225 230 235 240 

Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr 
260 265 270 

Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

Lys Asn Ala Pro Tyr Ser lie Phe Ala lie Lys Asn Gly Pro Lys Ser 
290 295 300 

His lie Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305 310 315 320 



1 



4 



-33- 

Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
325 330 335 

Ala Val Ala Arg Thr Thr Tyr Thr His Gin lie Thr Ala lie Pro Asp 
340 345 350 

His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser 
355 360 365 

Lys Glu Met lie Ala Leu Lys Asp Glu Thr Asn Pro lie Glu Glu Trp 
370 375 380 

Gin His lie Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser lie Arg Tyr 
385 390 395 400 

Pro Ala Trp lie Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser 
405 410 415 

Ser Tyr lie Asn Arg Arg lie 
420 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GAAGTTCCTA TTCTCTAGAA AGTATAGGAA CTTC 3 4 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GATCCCGGGC T AC C ATGG AG AAGTTCCTAT TCCGAAGTTC CTATT CTCTA GAAAGTATAG 6 0 

GAACTTCA 6 8 



